Automatic Slide Presentation From Semantically Annotated Documents

نویسندگان

  • Masao Utiyama
  • Koiti Hasida
چکیده

This paper discusses how to automatically generate slide shows. The reported presentation system inputs documents annotated with the GDA tagset, an XML tagset which allows machines to automatically infer the semantic structure underlying the raw documents. The system picks up important topics in the input document on the basis of the semantic dependencies and coreferences identified from the tags. This topic selection depends also on interactions with the audience, leading to dynamic adaptation of the presentation. A slide is composed for each topic by extracting relevant sentences and paraphrasing them to an itemized summary. Some heuristics are employed here for paraphrasing and layout. Since the GDA tagset is independent of the domain and style of documents and applicable to diverse natural languages, the reported system is also domain/style independent and easy to adapt to different languages.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Approaches for Annotating Medical Documents

Annotations are useful to semantically enrich documents and other datasets with concepts of ontologies. In the medical domain, many documents are not annotated at all and manual annotation is a difficult process making automatic annotation methods highly desirable to support human annotators. We propose a linguistic-based and a reuse-based approach annotating medical documents by concepts from ...

متن کامل

Utilization of Semantic Annotations in Interactive User Interfaces for Large Documents

With new techniques, such as Microformats or RDFa, for integrating semantics into existing web formats, we expect a strong increase of semantically annotated documents in the web. This paper describes a new approach for utilizing semantic annotations to improve the user interface for large text documents by guiding the user’s attention to semantically annotated text sections using interactive f...

متن کامل

A Framework for Modular Semantic Publishing with Separate Compilation and Dynamic Linking

We present the Active Documents approach to semantic publishing (semantically annotated documents associated with a content commons that holds the background ontologies) and the Planetary system (as an active document player). In this paper we explore the interaction of content object reuse and context sensitivity in the presentation process that transforms content modules to active documents. ...

متن کامل

Automatic Creation of Knowledge Graphs from Digital Musical Document Libraries

Most of the current musicological knowledge is present in printed books and manuscripts. In the last years greats efforts have been done in order to digitize and make available these documents in form of Digital Libraries. However, digital documents are mainly stored as raw text, with no more structure than indexes and some metadata. Therefore, implicit knowledge contained in text is not unders...

متن کامل

A semantically annotated Verbal Autopsy corpus for automatic analysis of cause of death

This paper presents a method employed in building a semantically annotated corpus of 11,741 Verbal Autopsy documents, each annotated with Cause of Death, based on verbal records of deaths of mothers, stillbirths, and infants up to 1 year of age, captured for analysis in Ghana between December 2000 and July 2010. Verbal Autopsy is a technique which involves interviewing individuals (such as rela...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999